Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 45211 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 29.2 MiB |
| Average record size in memory | 677.2 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 6 |
| Boolean | 4 |
pdays is highly correlated with poutcome | High correlation |
previous is highly correlated with pdays | High correlation |
housing is highly correlated with month | High correlation |
contact is highly correlated with month | High correlation |
month is highly correlated with housing and 2 other fields | High correlation |
poutcome is highly correlated with pdays | High correlation |
age is highly correlated with job | High correlation |
job is highly correlated with age and 1 other fields | High correlation |
education is highly correlated with job | High correlation |
day is highly correlated with month | High correlation |
previous is highly skewed (γ1 = 41.84645447) | Skewed |
balance has 3514 (7.8%) zeros | Zeros |
previous has 36954 (81.7%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-20 17:46:13.878186 |
|---|---|
| Analysis finished | 2022-10-20 17:46:27.899102 |
| Duration | 14.02 seconds |
| Software version | pandas-profiling v3.3.1 |
| Download configuration | config.json |